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Abstract —The IEEE 1588 protocol has received recent interest 
as a means of delivering sub-microsecond level clock phase 
synchronization over packet-switched mobile backhaul networks. 
Due to the randomness of the end-to-end delays in packet 
networks, the recovery of clock phase from packet timestamps in 
IEEE 1588 must be treated as a statistical estimation problem. 
A number of estimators for this problem have been suggested 
in the literature, but little is known about the best achievable 
performance. In this paper, we describe new minimax estimators 
for this problem, that are optimum in terms of minimizing the 
maximum mean squared error over all possible values of the un¬ 
known parameters. Minimax estimators that utilize information 
from past timestamps to improve accuracy are also introduced. 
Simulation results indicate that significant performance gains 
over conventional estimators can be obtained via such optimum 
processing techniques. These minimax estimators also provide 
fundamental limits on the performance of phase offset estimation 
schemes. 


1. Introduction 

In modern 4G cellular networks, precise synchronization 
between base stations is critical for ensuring seamless han¬ 
dovers, reducing interference and improving capacity. Given 
the high cost and effort associated with global positioning 
system (GPS) based synchronization, carriers often find it 
preferable to deliver timing via the mobile backhaul network. 
Since these backhaul networks are typically packet switched in 
nature, a popular timing approach (T] is to use SyncE OO 
for frequency synchronization and IEEE 1588 HI for phase 
synchronization. The topic of phase synchronization is the 
focus of this paper. A related requirement arising from 4G 
LTE (Long Term Evolution) networks is that neighboring base 
stations must be synchronized to within 1.25 jis of absolute 
phase error, to ensure efficient operation in the time division 
duplexing mode. 

In the IEEE 1588 precision time protocol (PTP), a master 
and a slave node exchange a series of packets to achieve 
phase synchronization. Packets traveling between the master 
and the slave encounter several intermediate network nodes 
such as switches or routers, accumulating random queuing 
delays at each node. The problem of finding the slave’s 
phase offset from the timestamps of the exchanged packets, 
while combating the random queuing delays, is referred to as 
phase offset estimation (POE). The PTP standard and related 
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literature prescribe the use of simple estimators such as the 
sample mean, minimum and maximum filters for POE. Several 
recent papers Isl-ITTI have studied methods to improve the 
performance of these filters, especially in the presence of large 
queuing delays due to high network loads. However, it is not 
well understood as to how close these POE schemes come to 
achieving the best possible performance. 

In this paper, we derive optimum estimators for the problem 
of POE, which, to our knowledge, have not been described 
previously in literature. To this end, in Section |IJ we begin 
by modeling POE as a non-Bayesian estimation problem. 
Specifically, we treat the phase offset as an unknown determin¬ 
istic parameter to be estimated from timestamps that are also 
affected by the fixed delays along the forward and reverse 
network paths. We then consider three observation models, 
with the degree of information available about the fixed 
delays varying between these models. The first model assumes 
complete knowledge of both the fixed delays, while the second 
model assumes only that the difference between the fixed 
delays, i.e. the delay asymmetry, is known. The third model 
assumes known delay asymmetry, as well as the availability 
of additional past observations which contain the same fixed 
delays but different phase offsets. We show that POE under all 
three models falls under a general class of estimation problems 
known as vector location parameter problems. In Section 
[In| for this general problem class, we derive the optimum 
estimator that minimizes the maximum mean squared error 
{maximum MSE) over all values of the unkown parameters, 
and is hence minimax optimum. This minimax estimator is an 
extension of the well-studied Pitman estimator C3, which is 
known to be minimax optimum for scalar location parameter 
problems. Other properties of the minimax estimator, related 
to the estimation of linear combinations of parameters, are also 
derived. 

In Section |IV| we simplify the general minimax estimator 
for the problem of POE under each observation model. In 
Section |Vj using the properties of the minimax estimator 
derived in Section III we show that under typical network 
assumptions, the MSE of the minimax estimator grows at least 
linearly with the number of intermediate nodes between the 
master and the slave. Our simulations in section IvJ compare 


the performance of the new minimax estimates against con¬ 
ventional estimators under several network conditions. Results 
indicate that there are several network scenarios where conven¬ 
tional estimation schemes fall significantly short of achieving 
the maximum possible synchronization accuracy. Eurther, in 
asymmetric network traffic scenarios, we show that significant 
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performance gains become available if we exploit information 
about fixed delays from past observations. 

The results in this paper extend our previous works caci, 
where lower bounds on the maximum MSE of POE schemes 
under the second observation model were derived. In this 
paper, we address more observational models, provide the 
tightest lower bounds on the maximum MSE of POE schemes 
under each model, and also specify the estimators that achieve 
these lower bounds. 


for i = 1, • • • , P. These observations can be collected to 
obtain the vector observation model 

y = 6e-\-w (6) 

where 

y = [yi yi]^ > Yk = [yi,k ■ ■ ■ yp,kf 0) 

W = [w]' , Wfe = [wi^k ■ ■ ■ Wp^k]^ (8) 

e = [Ip (—Ip)]"’’ (9) 


II. System Model 

Consider a scenario where the slave clock has a phase offset 
6 and zero frequency offset with respect to its master. To help 
the slave determine 6 , the IEEE 1588 PTP protocol allows 
a two-way message exchange between the master and slave. 
Eour timestamps are available to the slave after each two-way 
message exchange (for more details, see US): 
ti : Time of transmission of SYNC packet by master. 
t 2 : Time of reception of SYNC packet at slave. 
ts : Time of transmission of DELAY_REQ packet by slave. 
^4 : Time of reception of DELAY_REQ packet by master. 

In order to estimate 5, it is clearly sufficient for the slave 
to only retain the pair of timestamp differences 


and 1 AT is a X 1 vector with all elements equal to 1. 
2) Standard model (S-model)\ Here we assume that only the 
difference between d^^^ and referred to as the delay 
asymmetry, is known to the slave. By compensating the 
observations as 

yi,i = ^*1, Via = yU - df^ + df" (10) 
we obtain 

yi,i = d^ S+ Wi^i , yi ,2 = d - S ^ Wi ^2 (11) 

for i = 1, • • • , P, where d^^^ = d. These observations 
can be denoted vectorially as 

y = dl 2 p + + w = A6 + w (12) 


yi = t2-ti=di+S (1) 

y2 = U-h=d2-5 (2) 

Here di and d 2 denote the end-to-end (ETE) network delays 
in the master-slave and slave-master directions, respectively. 
Assume for simplicity that a common network path is taken 
by all packets traveling between the master and the slave and 
vice-versa. Then each ETE delay receives contributions from 
three factors: 

(a) Constant propagation delays along network links between 
the master and the slave (or vice-versa). 

(b) Constant processing delays at intermediate nodes (such as 
switches or routers) along each network path. 

(c) Random queuing delays at intermediate nodes along each 
network path. 

Hence each ETE delay can be modeled as 

= + d 2 =dt^^W 2 (3) 

Here and denote fixed delays corresponding to the 
sum of the constant propagation and processing delays, while 
wi and W 2 model the random queuing delays. 

Assuming the values of S, d^^^ and d^^^ remain constant 
over the duration of P two-way message exchanges, we can 
collect multiple observation pairs (^ 1 ,^ 2 ) to help estimate 6 . 
We denote these observations as 

yh = df"^ + ^ , yl 2 = df^ - 6 + (4) 

for i = 1, • • • , P. We now consider three observation models: 
1) Known fixed delay model (K-model): Here we assume 
that d^^^ and d^^^ are fully known at the slave. Hence, 
setting yi^k = yl^k ~ d^^^ we obtain the compensated 
observations 


where y and w are as defined in Q and and 


6 = [ 0 i 62 ]^ = [d-y 6 d- 


A = 


Ip Op 

Op Ip 


(13) 

(14) 


with 1 q, Oq represent Q x 1 vectors of ones and zeros, 
respectively. 

Note that this model also covers the case of symmetric 
path delays, where d^^^ = d^^^, and hence the delay 
asymmetry is zero. We further note that other cases where 
the relationship between the fixed delays is known, such 
as the case where the ratio d^^^/d^^^ is known, can also 
be handled using a model similar to ( p^ . Eor brevity, 
only the case of known delay asymmetry is considered 
here. 

3) Multiblock model (M-model): Here we assume, as in the 
standard model, that the delay asymmetry is known to 
the slave. Suppose we refer to a set of P observation 
pairs as a block. In this model, we further assume that in 
addition to the current block, we have observation pairs 
from B previous blocks available to us. The phase offset 
6 is modeled as being constant for all observation pairs 
within each block, but varying between different blocks. 
The fixed delay d is modeled as constant across all P +1 
blocks. This model is representative of scenarios where 
changes in the fixed delay occurs over longer time scales 
than changes in phase offset. We denote observations 
pairs in past blocks using the notation 


yi,j,i — d ^ 5j Wij^i 


yi,j,2 


= d — Sd 


■ '^*,7,2 

(15) 


and observation pairs in the current block as 


= -S^ Wi^2 ( 5 ) 


yi,i = dS ^ Wi^i , yi,2 = d - 6 ^ Wi^2 ( 16 ) 


1 =S-\- Wi^i , yi^2 
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for i = 1, • • • , P and j = 1, • • • , P. 

We thus obtain the vector observation model 

y = GO + w (17) 

where 

y=[yiy2]^> (is) 

y*; = [yi.fc ■■■ yp,k yi,i,k yi,2,k yB,p,k]^ (19) 

W=[w]'w|]^, (20) 

Wfe = ■ ■ ■ Wp^k Wi,l,k Wia,k ■ ■ ■ WB,P,kV 

( 21 ) 

d = [d5 5i ■■■ (22) 

G = [I 2 BP Z ® Ip], Z = [Ip (-Ip)]’’’ (23) 

and Ib, ^ denote the identity matrix of size B and the 
Kronecker product operator, respectively. 

In practice, the S-model and M-model are more practical 
than the K-model, since they only assume that the difference 
between the fixed delays is known. We still consider the K- 
model in this paper since it provides us with useful limits on 
the performance of optimum estimators under the M-model. 

Given either of the observation models, the problem of POE 
is to estimate 6 from the observation vector y. Here we further 
make the following assumptions: 

(i) All the queuing delays are strictly positive random vari¬ 
ables that are mutually independent. 

(ii) All forward queuing delays share a common pdf 
Similarly the reverse queuing delays share a common pdf 
f2{w). 

(iii) The maximum possible value for a forward or reverse 
queuing delay is finite. 

(iv) All the unknown fixed delays and phase offsets are 
deterministic parameters, i.e. no probability distributions 
for these parameters are known a priori. 

III. Minimax Estimation for General Location 
Parameter Problems 

We now consider a general class of estimation problems, 
where the effect of the unknown parameters is to shift the 
location of the pdf of the observations without modifying the 
underlying shape of the pdf. The POE problems under all three 
observation models considered in Section |I^ belong to this 
general class of problems. The general results derived here 
shall be applied to the POE models in Section]^ The proof of 
all the lemmas and theorems stated in this section are provided 
in the appendix. 

We first define the general class of problems we are inter¬ 
ested in studying. 

Definition 1 (Vector Location Parameter Problem): 

Suppose we want to estimate a linear combination c?6 of the 
unknown parameters contained in 0 G (where c G 
is a constant vector), based on observations x G If the 
observations have a pdf of the form 


(24) 


for some N x M matrix G and function fo{’), then we 
shall refer to such an estimation problem as a vector location 
parameter problem. 

All the definitions and theorems in the remainder of this 
section apply specifically to this vector location parameter 
problem. The results we derive further require that the function 
/o (x) be non-zero over a bounded, positive range of values of 
its arguments, as defined below. 

Definition 2 (Finite Support): We say that /o(x) in p4| ) has 
finite support if there exists a finite L > 0 such that /o(x) = 
0 whenever all the elements of the vector x lie outside the 
interval [0,1/]. 

It is typical in statistical literature to characterize the per¬ 
formance of an estimator via the mean squared error (MSB) 
metric. There are three ways to define the MSB metric: 

1) The conditional MSE 

T^{yiX):Q)= ( [fi'(x) - c'^0]^/(x|0)dx (25) 

2) The maximum MSE 

M{g{'x.))= sup n{g{'x.),e) (26) 

3) The average MSE 

S(5'(x),p(0)) = / TZ{g{x),e)p{e)dd (27) 

Jrm 

where p{6) is a prior distribution defined over 6 G 
In this section, we consider the problem of finding estimators 
that are optimum in terms of minimizing the maximum MSE, 
and refer to such estimators as minimax estimators. The 
definitions of the conditional and average MSEs shall be used 
in the proofs of the optimality of the minimax estimator. 

We now consider a class of estimators known as shift 
invariant estimators, defined as follows. 

Definition 3 (Shift Invariant Estimator): We say that an es¬ 
timator ^(x) of c^O is shift invariant if for the same matrix 
G used in ( |24| ), 

^(x + Gh) = ^(x) + c^h (28) 

for all h G 

While the conditional, maximum and average MSEs are in 
general different for a estimator, for a shift invariant estimator 
they are always equal, as stated in the following lemma. 

Lemma 1: Any shift invariant estimator ^(x) of cf 6 has a 
conditional MSE that is constant with respect to 6, and satisfies 

n{g{x.),e) = M(g(x)) = B(g(x),p(d)) (29) 


for any choice of prior distribution p(0). 

We now give the expression for the minimax estimator and 
prove its optimality using Definition and Lemma 

Theorem 1 (Minimax estimator): If /o(x) has finite sup¬ 
port, then the estimator 


^ lRM[c^o]f{x\e) de 

q (x) = ^^— 

Srm /(x| 6>) d6> 

satisfies the following properties: 

(i) g*(x) is shift invariant. 


( 30 ) 


/(x|0) = fo(x-G0) 
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(ii) 5 '*(x) is a minimax estimate of c^O. 

(iii) Among all estimators of c^O that are shift invari¬ 
ant, ^*(x) achieves the minimum conditional MSE 
n{g{x.),6). 

(iv) ^*(x) is unbiased, i.e. E { [^*(x) — \ = 0. 

An interesting property of the minimax estimator is that 
for a given set of observations, the minimax estimate of a 
linear combination of parameters is identical to the same 
linear combination of the minimax estimates of each of the 
parameters. Eormally, this can be stated as follows. 

Lemma 2: Let 6 = [0i • • • and let ^*(x) represent 

the minimax estimate of 6^^. If c = [ci • • • cm]^, then the 
minimax estimate ^*(x) of c^O satisfies 


where 


p 

/w(w) = P[ fl{Wi^i)f2{Wi^2) (35) 

i=l 

Hence, according to Definition this is a vector location 
parameter problem. Thus, using Theorem we obtain the 
minimax estimator of S as 


.. . ^ /R<^/w(y-fc)d (5 
/k/ w(y - (5e)d(5 


(36) 


2) Standard Model: As stated in ([ig, here the pdf of the 
observation vector y has the form 


M 


5*(x) = ^Ci5*(x) 


/(y|^) = /w(y - A0) (37) 

= /w,i(yi - ^iip)/w, 2 (y 2 “ ^ 2 ip) (38) 


This property will allow us to simplify the form of the 
minimax estimator under the S-model in Section [Ivl Another 
interesting property of the minimax estimator emerges when 
we consider multiple minimax estimates, each based on a 
different observation vector. Here we can show that the sum 
of the MSEs of the individual minimax estimates will always 
be less than the MSE of the minimax estimate based on sum 
of all the observation vectors. This can be formally stated as 
follows. 

Theorem 2: Let xi, • • • , x^ be A^-dimensional random 
vectors with pdfs of the form 

/(x/c|0/c) = //c(x/c - GkOk) (31) 

where fk{') has finite support for /c = 1, • • • ^K. Assume that 
xi, • • • , xx are all mutually independent conditioned on the 
unknown parameters, i.e. the joint pdf 
satisfies 

= /(Xfei|0fei)/(xfc2|0fej (32) 

for all values of ki and k 2 . Let hl{-Kk) denote the minimax 
estimate of Eurther, let x = Ylk=i ^ = Ylk=i 

and let ^*(x) denote the minimax estimate of c^O from x. 
Then ^*(x) satisfies 

K 

M{g*{x)) > (33) 

k=l 


where 


p 

/w(w) = '[[fl{Wi^i)f2{Wi^2) , 
i=l 
P 

= '[[fk{u>i,k) for A: = 1,2 

i=l 


(39) 

(40) 


Hence, according to Definition this is a vector location 
parameter problem. Our goal is to estimate 6 = 6 (where 

c = [0.5 —0.5]^) from the observation vector y = A0+w. 
Hence, using Theorem we obtain the minimax estimate 


^ ^ /R 2 [c^g]/(y|^) de 
/r. fiy\o) de 


(41) 


Using Lemma we can further simplify the estimator as 


/R(*i/w,i(yi -(*ilp) d6>i 

/R/w,i(yi -dilp) d9i 

_ /R^2/w,2(y2 - (*21 f) d6>2 

/r /w,2(y2 - ^21 f) d02 

3) Multiblock Model: As stated in ( p^ , here the pdf of the 
observation vector y has the form 

/(y|^) = /w(y - G0) (43) 



This property will be useful in proving certain properties of 
the minimax estimator for POE in Section [Vl 

IV. Simplification of Minimax Estimator for the 
POE PROBLEM 

We now use the results in Section |nl| to obtain minimax 
optimum estimators under the three POE observation models 
discussed in Section]^ and simplify the resulting expressions. 

1) Known fixed delay model: As stated in ([^, the pdf of the 
observation vector y has the form 


where 


B P 2 

/w(w) = nnn fk{'^i,j,k) (44) 

i=lj=lk=l 

Hence, according to Definition this is also a vector loca¬ 
tion parameter problem. Our goal is to estimate 6 = 6 

from y, where c = [0 1 0 —• 0 ]^. Using Theorem |l| we 

P — 1 zeros 

obtain the minimax estimate 


^(y) 


4^r(^,y)d,5 

/Rr(< 5 ,y)d <5 


/(y|<^) = /w(y - Se) 


(34) 


(45) 
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where 


r(( 5 ,y) = / 

Jr 


P 2 

li=i k=i 

■ ^{d,y) d{d) 


^jid,y)= n 

i=i 


P 2 


(46) 


Jr ■ _ 1 -1 


^ i=i k=i 


(47) 


In scenarios where analytical expressions for the queuing 
delay pdfs /i (w) and /2 (w) are known, it might be possible to 
further simplify the integrals in ( [3^ , ( [42| ) and ([45])-([47)). In the 
more general case of arbitrary pdfs /i {w) and /2 (w), these in¬ 
tegrals can be computed by approximating them with Riemann 
summations. In such cases, the computational complexity 
associated with the minimax estimators will depend on the 
number of bins used in the Riemann summations. Typically, 
this computational complexity is significantly higher than that 
of conventional estimators such as the sample minimum, mean, 
median or maximum estimators. 

Due to the nature of the POE observation models, some 
comments on the MSE of the minimax optimum estimator (or 
the minimax MSE) can be made directly, without requiring 
numerical evaluations. Eirstly, the minimax MSE under the 
K-model is guaranteed to be lower than that under the S- 
model or M-model, since the nuisance parameter d is absent 
from the K-model. Eurther, the minimax MSE under the M- 
model is guaranteed to be lower than that under the S-model, 
since the M-model has additional information from past blocks 
available to it. This past information can be used to reduce 
the uncertainty associated with the nuisance parameter d, and 
hence improve the estimate of S. 


V. Minimax MSE under IID single-node queuing 

DELAYS 


Assume that the same is true in the reverse direction as well. 
Then p{N) satisfies 


p{KL) > Kp{L) 


(48) 


where K and L are any two positive integers. 


Eor L = 1, the relation in (48) reduces to p{K) > Kp{l), 


which essentially implies that in networks with Ltd. single- 
node queuing delays at all intermediate network nodes, the 
minimax MSE grows at least linearly with the number of 
nodes. This interpretation can be especially useful for network 
designers, since it provides a computationally simple upper 
limit on the number of nodes that can be allowed between 
the master and the slave for a given synchronization accuracy 
requirement. A typical example where independent, identically 
distributed single-node queuing delay distributions can be 
assumed is a network in which only cross traffic flows (defined 


in Section [Vl| ) are present. Note that a relationship similar to 
( [48] ) can also be derived under the K-model and the M-model. 
Eor brevity, only the S-model is considered in this section. 


VI. Simulation Results 

We now compare the performance of conventional POE 
schemes against the newly derived minimax estimators. To 
this end, we consider a few network scenarios motivated by 
the ITU-T recommendation G.8261 O. The metric we use to 
quantify estimator performance is the maximum MSE. Eor 
brevity, we refer to the maximum MSE as simply the MSE 
throughout this section. 

We consider four commonly used conventional POE 
schemes, namely the sample minimum, maximum, mean and 
median filtering schemes. Given the observation vector y of 
either the K-model or the S-model, these schemes use an 
estimator of the form 

f ^ e(yi)-^(y2) . 49 ^ 

2 ^ ^ 


The performance of the minimax estimators described in 
Section [IV] depends on the nature of the network queuing 
delays, which in turn depends on the number of nodes present 
between the master and the slave. Theorem [2] can be used 
to obtain a simple relationship between the minimax MSE 
and the number of intermediate nodes, under certain network 
conditions. We state this relationship in the form of the 
following corollary to Theorem [^ with the proof provided 
in the appendix. 

Corollary 1: Consider a network consisting of a master 
and a slave separated by N nodes. Let p{N) represent the 
minimax MSE associated with POE under the S-model in this 
scenario, for a fixed number of two-way message exchanges. 
Let the single-node queuing delay refer to the queuing delay 
experienced by packets at any single nod^ Assume that the 
single-node queuing delays across all nodes in the forward 
direction are independent and identically distributed (i.i.d.). 

^Measurements of the single node queuing delay in the forward or reverse 
direction would corresponding to the proper entries of the vector w in 03 
for the case where only N=1 node is involved. 


where ^(x) denotes either the minimum, maximum, mean 
or median of the elements of the vector x. Under the M- 
model, these estimators behave exactly as under the S-model, 
discarding information from past blocks since they have no 
means of utilizing it. It is easy to show that these estimators are 
shift invariant under all three observation models. They also 
have an identical value for the MSE across all three models, 
given as 


M{5) =eU' 


where 


P = var < 5 


e = 


e = 


]- 


2 I 2 

— (J p 


= 4[var{C(wi)} + var{^(w2)}] 
represents the estimator variance, while 

M ^ E[^(wi)] -E[C(w2)] 


(50) 


(51) 

(52) 


(53) 
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Traf. Model 

Packet Sizes (Bytes) 

% of Load 

TMl 

{64,576,1518} 

{80%, 5%, 15%} 

TM2 

{64,576,1518} 

{30%, 10%, 60%} 


TABLE I: Models for composition of background traffic 
packets 


represents the estimator bias. Note that 

-E’[5'(wi)] = J ^(wi)/wi(wi)dwi , (54) 

var{^(wi)} = / {^(wi) - /wi(wi)dwi , 

J Wi 

(55) 

p 

/Wi(wj) = JJ fi{wij) (56) 

j = l 

It is easy to see from ( [53] ) that when the forward and reverse 
queuing delay distributions fi{w) and f 2 {w) are not identical, 
li can be non-zero, and hence have a significant contribution 
in the MSB expression in ( |50| . This can be avoided by 
subtracting out the bias, to obtain the unbiased estimate 

6 = 6- ij. (57) 


Hence, in our results, we measure the performance of con¬ 
ventional estimators as their MSB after their bias has been 
compensated. 

In order to obtain the queuing delay distributions, we 
consider a Gigabit ethernet network consisting a cascade of 20 
switches between the master and slave nodes. Each switch is 
assumed to be a store-and-forward switch, which implements 
strict priority queuing. We consider two types of background 
traffic flows in this network: 


1) Cross traffic flows: In such traffic fiows l^lfTTlI . fresh 
background traffic packets are injected at each node along 
the master-slave path, and these packets exit the master- 
slave path at the subsequent node (see 4-switch example 
in Big. [T^. The arrival times and sizes of the packets 
injected at each switch are assumed to be statistically 
independent of that of packets injected at other switches. 

2) Mixed traffic flows: Here a mixture of cross traffic fiows 
and inline traffic flows are present in the network. Inline 
traffic fiows ii are characterized by packets that are 
injected only at the first switch along the master slave 
path, and that travel along the same path as synchroniza¬ 
tion traffic through the entire cascade of switches (see 
4-switch example in Big. lb). 


With regard to the distribution of packet sizes in background 
traffic, we consider Traffic Models 1 (TMl) and 2 (TM2) from 
the ITU-T recommendation G.8261 lO for cross traffic fiows, 
as specified in Table |T| Bor inline traffic fiows, we consider a 
third traffic model where packet sizes are uniformly distributed 
between 64 and 1500 bytes [7]. We assume that the interarrival 
times between packets in all background traffic fiows follow 
exponential distributions. We refer to the percentage of the 
link capacity consumed by background traffic as the load. 
In order to achieve a particular load, we accordingly set the 
rate parameter of each exponential distribution. The queuing 


Master 


Background Traffic Sources 

I ■ ■ ■ 

li. 


^1 

1 

11 

1 

1 


Jl 


IV 

N 

X 

Switch 3 Switch 4 


Slave 


(a) Cross traffic flows 


Background 
I Traffic Source 


Background 
Traffic Sink 



Slave 


Switch 1 Switch 2 Switch 3 Switch 4 


(b) Inline traffic flows 

Big. 1: Examples of four switch networks with cross and 
inline traffic fiows. Red lines indicate network links, blue lines 
indicate the direction of background traffic fiows, and green 
line represents the direction of synchronization traffic fiows. 


delay distributions under a number of network scenarios are 
plotted in Big.|^ These distributions were obtained empirically 
using low-level queue simulations. Without loss of generality, 
we assume that fixed delay components of the ETE delays 
equal zero, hence the support of the queuing delay distributions 
always begins at zero in the plots. 

The MSE of various estimators under different observation 
models and network conditions are plotted versus the number 
of observation pairs/samples P in Bigs. [^ - [7 In order to 
compute the minimax estimates, the integrals in (^, ( |42| ) and 
© were replaced with Riemann sums. The spacing between 
adjacent Riemann summation bins was set to 0.001 /iS, to 
ensure that the additional error introduced due to the Rie¬ 
mann sum approximation is small relative to the MSE being 
computed. Burther, to facilitate comparisons against the LTE 
synchronization requirement of 1.25 /iS of synchronization 
accuracy, the estimation error standard deviation required so 
that the absolute estimation error lies under 1.25 /iS with a 
5-sigma level of certainty is also plotted over the curves. Here 
the 5-sigma level of certainty implies that on average, only 
about 6 out of 10^ estimates will have absolute estimation 
error that exceeds 1.25 /is. Some key observations we can 
make from the results are: 

1. Performance under symmetric cross traffic (Fig. Here, 
the gap between the K-model and S-model minimax esti¬ 
mators is negligible under all four loads (20%, 40%, 60%, 
80%) considered. Hence, under these network scenarios, 
there is little performance to be gained from the additional 
knowledge about fixed delays that the K-model provides 
over the S-model. Burther, while the sample minimum 
estimator performs near-optimally at 20% load, at higher 
loads none of the conventional estimation schemes come 
close to achieving minimax optimal performance. In fact, 
at 80% load, the minimax estimators achieve the LTE 
synchronization requirement using only about 200 samples. 
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while about 800 samples are required by the best conven¬ 
tional estimator. 

2. Performance under symmetric mixed traffic (Fig. [^: Here, 
there is a fair gap between the K-model and S-model 
minimax estimators under the lower load scenario of Fig. 

which disappears under the higher load scenario of Fig. 
Further, the S-model minimax estimator requires about 
50% fewer samples than the best conventional estimator, to 
achieve the LTE synchronization requirement under the low 
load scenario. Interestingly, the sample mean filter performs 
near-optimally under the high load scenario. This indicates 
that the performance gap between the best conventional 
estimator and the minimax estimator may need to be 
studied on a per-case basis, and predicting general trends 
might be difficult. 

3. Performance under asymmetric traffic (Figs. j^and0: Here 
there is a significant gap between the K-model and S- 
model minimax estimators, with the K-model minimax 
estimator requiring about 90% and 22% fewer samples than 
the S-model estimator in Fig. and Fig. |7j respectively, 
in order to meet the LTE synchronization requirement 
threshold. This is expected in cases where the queuing 
delay distribution in one network direction has significantly 
lower spread than in the other direction. In such cases, 
the MSB of conventional estimators, given b y ([5^ , is 
dominated by either the first or second term in (|52|) if one 
these variances is much larger than the other. On the other 
hand, the K-model estimator can utilize knowledge of the 
fixed delays to base its estimate on only the observations 
corresponding to the direction with lower variance, thereby 
eliminating large contributions to its MSB caused by the 
queuing delay distribution that has higher variance. 
Further, since the M-model estimator can use information 
from B past blocks to estimate the fixed delay, we expect it 
to achieve the performance of the K-model estimator in the 
limiting case where B ^ oo. In our simulations, we ob¬ 
serve that M-model minimax estimator closely approaches 
the K-model minimax estimator in performance for fairly 
small values of B (between 5 and 20). 

VH. Conclusions 

We derived minimax optimum estimators for a general 
class of location parameter problems, and applied them to the 
problem of phase offset estimation under multiple observation 
models. In cases where the pdf of the queuing delays are 
known, minimax estimators can be used to obtain the best 
possible estimation performance. The MSB curves of the 
minimax estimators can also serve as a design tool for practical 
synchronization deployments, by providing fundamental limits 
on POE performance for a given set of network conditions. 
Our simulation results indicate that conventional estimators 
can perform close to optimum in certain low-load scenarios 
with symmetric queuing delay distributions. However, opti¬ 
mum estimators appear to provide significant performance 
benefits in scenarios where the queuing delay distributions 
are asymmetric, a case that occurs frequently in practice. The 
results in this paper could help guide the development of new 


h 

5b 


POE schemes that address synchronization challenges arising 
in current and future generations of mobile networks. 


Appendix 

Proof of Lemma^ Eor any shift invariant estimator ^(x), 
we can show that if 61 and 62 are any two values of the 
parameter vector with h. = 61 — O 2 , then 


n{g{yL),ei) 

= [ [fl(x)-c'^(02 + h)]2/o(x-G(02 + h))dx 

JRN 


-j 

JR 


[fl(x - Gh) - c'^02]^/((x - Gh)|02)dx 
[5(x) - Jd2ff{x\d2)dx 


(using a change of variables) 

= ^(5(x),02) 


(58) 

(59) 

(60) 

(61) 


Hence g{'x.) has constant conditional MSB w.r.t. 0. Eurther, 
using the definitions of the maximum and average MSEs, we 
obtain 7^(fi((x),0) = M{g{x)) = I3{g{x.),p{e)). ■ 


Proof of Theorem 

(i) It is simple to show that g* (x) is shift invariant, since 


5 *(x + Gh) = 


/R«[c^^]/o(x + Gh-G 0 ) dd 
V/o(x + Gh-G0) de 
/RM[c^0]/(x|0-h) de 


/rm /(x|6> - h) d6> 
^ /RM[c^^]/(x|g) dd 
/rm /(x|0) de 
= fi'*(x) + c'^h 


c^h 


(62) 

(63) 

(64) 

(65) 


any choice of prior distribution p(0), any estimator 

^ of 


Eor —j -j... 

g{x.) of c^O satisfies 

M{g{x)) > supB{g{x),p{e)) 

p{0) 


> sup inf B{g{x),p{e)) = Bo 

p( 0 ) 9 W 


( 66 ) 

(67) 


Eurther, it can be proved (by contradiction) that 
A4(g(x)) = Bo holds if and only if ^(x) is minimax. 
Now consider the estimator g'^(x) of ( [3Q| ). Erom (67), 
we already have M.{g*{x)) > So- We shall now show 
that So > A4(g*(x)) also holds, hence proving that 
So = Ad(^*(x)), and thus that ^*(x) is minimax. 
Consider a sequence of prior distributions Pi{6), each 
uniformly distributed over a support set 0^ for i = 
1, 2, • • •, where 


ei = {6 : i-i) • 1m < 6/ < Mm} (68) 

Here the inequality < 0 < i • 1 m implies 

that all the elements of the vector 6 lie in the interval 
[— 7 , 7 ]. Given a prior distribution pi(0), the estimator that 







minimizes B{g{x.)^p{0)) is the minimum mean square 
error (MMSE) estimator, 

5i(x) = / [c^e]fi{d\x) de (69) 

where fi{0\x) represents the posterior pdf 

= /(x|6>)p^(0) ^ /(x|6>) 

/ 06 ®,/(x| 0 )Pi( 0 ) d0 /(x|0) d0 

(70) 


Hence we can write 


Bo = sup inf B(5 (x),p(0)) 

p(e) S(x) 


> infB(5(x),pi(0)) = B{gi{x.),pi{e)) 
9 


(71) 


Further, since /o(x) has finite support, we have 


lim ^i(x) = lim 

i—>-oo i—>-oo 


Lee, /(^l^) 


/eee(x)[c'^^]/o(^~^^) 

/0ee(x)/o(x-G0) d0 


where 


5*(x) 


©(x) = : (x — G0) > 0 and (x — G0) < L • Ijyj 

(72) 

and hence 

lim B(gi(x.),pi(d)) 

1^00 

= lim B{g*{x),pi{e)) (73) 

1^00 

= lim Al(^*(x)) (Since ^*(x) is shift invariant) 

i^oo 

(74) 

= M(^*(x)) (75) 

From ( [tT) and we obtain Bq > M(g''(x)), hence 
completing the proof. 

(iii) Since (x) is shift invariant, from Lemma we have 
7Z(g(x), 6) = A4(g(x)). Further, since all shift invariant 
estimators have constant conditional MSE, and g'^(x) 
minimizes A4(g(x)), it also minimizes lZ(g(x)^6) for 
every value of 9. 

(iv) We shall prove that ^*(x) is unbiased by contradiction. 
Assume ^*(x) is biased. Since ^*(x) is shift invariant, 
its bias should be constant with respect to 9. Let 

/3 = E { [g*{x) - I 0} = E {g*{x) \ 0 =m} 

(76) 

denote this constant bias. Now consider a new estimator 
of c^9, given as 

5(x)=5*(x)-^ (77) 

It is easy to show that g(x) is also shift invariant. Further, 

7W(5(x)) =e|[p(x) I (78) 

= e{[5*(x)-c^0]' I 0 } (79) 

-2/3E{[5*(x)-c^0] \d}+l3^ (80) 

= M{g*{^))-p^ < M{g*{-s.)) (81) 


However, this is impossible since ^*(x) has already 
been shown to minimize the maximum MSE. Thus, the 
assumption that ^*(x) is biased is incorrect. 


Proof of Lemma ^ Using theorem we obtain 
/km ^i/(x|0) dd 


9i{^) = 


/rm /(x| 6 >) d 6 > 


(82) 


and 


5*(x) = 


(83) 


J^M[c^0]f{x\d) dd 

/rm /(x| 0 ) dd 

,S 4 , 


J^M /(x|0) dO 




hence proving the theorem. ■ 

Proof of Theorem Consider the problem of estimating 
c^9 given all the observations xi, • • • , x^. It is easy to show 
that this problem is a vector location parameter problem as 
defined in Section III Hence, a minimax optimum estimator 
for this problem can be obtained via Theorem Denote 
this minimax estimator as /i*(xi, • • • , x^). Further, note that 
/i*(xi,''' ,xx) and ^*(x) are both estimators of c^9, but 
,xx) has more information available to it, since 
Z]/c=i = X. Hence, we must have 


M(g^(x)) > A4(/i*(xi,--- ,xk)) 
Further, using Lemma it can be shown that 


K 


,xk) = 


(85) 


( 86 ) 


k=l 


Since minimax estimators are shift invariant, we can write 


,xk)) 

= sup e| [/i*(xi,--- , xk ) - c '^6]^ I 0] 


= e|[/i*(xi,--- ,XK)f 
This can be further simplified as 
,xk)) 


0 = 


0m| 


= E 


K 


J2hli^k) 


.k=l 


01 = Om, • • • , 0K = Om 


(87) 

( 88 ) 

(89) 


(90) 

(91) 




k=l 


ki = lk2 = l L 
k27^ki 


0ki =OM,0k2 =0m| 


(92) 


We note that /i^(xi), • • ■ , h^ix-x) are all mutually indepen¬ 
dent conditioned on the unknown parameters, due to our initial 
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assumption that xi, • • • , are mutually independent as per 
( [3^ . Hence, we obtain 


■ ■ ■ ,xk)) 


(93) 


= |[ft.^(xfe)] dk=OM^ 

k=l 

K K 

E E =0m} 


^1 = 1 A:2 = 1 

k27^ki 


{(^^ 2 (^* 2 ) ^fc2=0M} (94) 


Since hl{-Kk) is a minimax estimator, it is unbiased and shift 
invariant according to Theorem and hence 


{(lik(xfe) 0 = Om} = 0 , 


E{[(ifei(xfei)]^ = Om} = A4(/ifc(xfc)) 

From ( |^ , ( [^ and ( |^ we obtain 

K 

Finally, from 


k=l 


and ( [ 97 ] ), we obtain 

K 

M{g*{x)) > '^M{hl{xk)) 

k=l 


(95) 

(96) 

(97) 

(98) 


hence concluding the proof. ■ 

Proof of Corollary We shall prove this corollary by 
applying Theorem to POE under the S-model. To this end, 
assume N = KL, where K and L are both integers. For the 
A/'-node network, assuming P pairs of timestamp differences 
are collected per the S-model, the observation vector can be 
written similar to ( p^ , as 


y = dl 2 p + + v^ 


(99) 


where d and 6 represent the unknown fixed delay and phase 
offset, while w represents the 2P x 1 vector of queuing delays. 

Now suppose that the cascade of = iTL nodes is split 
into K smaller cascades, each consisting of L nodes. Each 
cascade of L nodes is placed between a new master-slave pair, 
resulting in K new networks (see example in Fig. 13. Let the 
phase offset of the slave in the network be (5*^, and let 
the fixed delay in the network be d^P. Assume that the 

phase offsets and fixed delays satisfy the relation 

K K 

y2s^’^'> = 6, = (100) 

k=l k=l 

Assuming that P observation pairs are collected per the S- 

model, the observation vector for each L-node network can 
be written, similar to ( p^ , as 

y(/c) ^ (101) 

for k = 1, • • • , . Here represents the 2P x 1 vector 

of queuing delays in the P^ network. Since the single-node 
queuing delays across all nodes are identically distributed, 
the minimax MSB associated with estimating S^P from y^P 


Master 



Node 1 — — — - Node 6 
(a) Original network 


Slave 



Phase Offset: 
S = + 5^2) 



Nodel Node 2 Node 3 Phase Offset: 5^ 


Master 2 . 2 



Node 4 Nodes Node 6 Phase Offset: 5® 

(b) Networks obtained after splitting 

Fig. 2: Example of a network containing N = 6 intermediate 
nodes, that has been split into K = 2 networks, each 
containing L = 3 intermediate nodes. 


will be identical, and equal p{L) in all the L-node networks. 
Note that due to the shift invariance of the minimax estimator 
and the result in Lemma [T] the minimax MSB will remain 
unchanged regardless of the assumption in ( |100| ), since the 
minimax MSE does not depend on the value of S^P or d^P. 
In order to apply Theorem we note that the queuing delay 
vector under the KL node network can be written as sum of 
the queuing delay vectors under each L-node network, i.e. 
w = • Further, due to the assumption that the 

single-node queuing delays are mutually independent, we have 

y.(y (/Cl) ^ y (/C2) |^(/Cl) ^ ^(/Ci) ^ ^(/C2) ^ ^(/C2) ) 

= \S^P\d^P^)f{y^P^\S^P\d^P^). ( 102 ) 

Due to the assumption in ( |100| ), we also have 


y = dl 2 p + + w 

r K 1 




I 2 P + 


lk=l 

K 


■ K 

^S^P 

./c=l 


fe=i 


e + 

1 - 


./c=l 


K 

- 

k=l 


(103) 

(104) 

(105) 


Noting the similarity in the relationships between y, and 
the vectors x, x^ in Theorem]^ we can apply Theorem]^ to 
obtain the relation 


p{KL) > Kp{L) 

which concludes the proof. 


(106) 
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Fig. 4: Performance comparison of different estimators under 
symmetric cross traffic. 
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Fig. 7: Performance comparison of different estimators under 
asymmetric mixed traffic. Traffic models used are TM2 for 
cross traffic and uniform packet size distribution for inline 
traffic. The forward path has 40% inline load and 20% cross 
load, while the reverse path has 20% inline load and 20% 
cross load. 
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symmetric mixed traffic. 
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Fig. 6: Performance comparison of different estimators under 
asymmetric cross traffic. Forward path: 80% Load (TMl), 
Reverse path: 20% Load (TMl). 







































